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In accordance with certain preferred embodinnents of this aspect of the invention, there are provided products, 
corripositions and rr^ethods for (i) assessing aroE expression, (ii) treating disease for exannple, otitis nnedia, con|unc- 
tivitis pneumonia bacteremia, meningitis, sinusitis, pleural empyema and endocarditis, and most particularly menin- 
oitis such as for example infection of cerebrospinal fluid., (iii) assaying genetic variation, (iv) and administering a aroE 
poiypeptide or polynucleotide to an organism to raise an immunological response against a bacteria, especially a 
Streptococcus pneumoniae bacteria. 

In accordance with certain preferred embodiments of this and other aspects of the invention there are provided 
Dolvnucleotides that hybndize to aroE polynucleotide sequences, particularly under stringent conditions. 

In certain preferred embodiments of this aspect of the inventbn there are provided antibodies against aroE polypep- 
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.n accordance with another aspect of the invention, there are provided aroE agonists and antagonists each of 
which are also preferably bacteriostatic or bacteriocidal. 

In a further aspect of the invention there are provided compositions comprising a aroE polynucleotide or a arob 
Dolvoeptide for administration to a cell or to a multicellular organism. 
IS Various changes and modifications within the spirit and scope of the disclosed invention will become readily ap- 

parent to those skilled in the art from reading the following descriptions and from reading the other parts of the present 
disclosure. 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 

The following drawings depict certain embodiments of the invention. They are illustrative only and do not limit the 
invention otherwise disclosed herein. ^ ,_ 

Fiaure 1 shows the polynucleotide sequence of Streptococcus pneumoniae mo£ [SEQ ID NO:1]. 
Figure 2 shows the amino acid sequence of Streptococcus pneumonme aroE [SEQ ID NO:2] deduced from the 
25 polynucleotide sequence of Figure 1 . 

GLOSSARY 

The following definitions are provided to facilitate understanding of certain terms used frequently herein 
30 "Host cell- is a cell which has been transformed or transfected, or is capable of transformation or transfection by 

an exogenous polynucleotide sequence. 

•Identity " as known in the art, is a relationship between two or more polypeptide sequences or two or more poly- 
nucleotide sequences, as determined by comparing the sequences. In the art, -identity" also means the degree of 
seauence relatedness between polypeptide or polynucleotide sequences, as the case may be, as determined by the 
35 match between strings of such sequences. "Identity" and "similarity" can be readily calculated by known methods 
(Computational MolBcularBiology. Lesk. A.M.. ed.. Oxford University Press. New York, 1 988; Biocomputing: miorrnaUcs 
and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993, Computer Analysis of Sequence Data Part 
1 Griffin A M and Griffin H G , eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology 
vonHeinie G Academic Press, 1 987: and Sequence Ana/ys/s Pnmer, Gribskov, M. and Devereux, J , eds., M Stockton 
40 Press New York 1 991). While there exist a number of methods to measure identity and similarity between two se- 
nuences both terms are well known to skilled artisans (Sequence Analysis in Molecular Biology von Heinie, G ^Ac- 
ademic Press 1987 Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds,, M Stockton Press, New York, 
1991 and Cahllo H and Lipman. D.. SI AM J. Applied Math., 48: 1073 (1988). Methods commonly employed to de- 
termine Identity or similarity between sequences include, but are not limited to those disclosed in Canllo. H. . and Lipman, 
45 D SIAM J Applied Math , 48:1073 (1988). Preferred methods to determine identity are designed to give the largest 
match between the sequences tested. Methods to determine identity and similarity are codified in publicly available 
computer programs. Preferred computer program methods to detemi.ne identity and similarity between two sequences 
include but are not limited to, GCG program package (Devereux, J , el a! , Nucleic Acids Research 12(1). 387 (1984)), 
BLASTP BLASTN and FASTA (Atschul, S.F et al , J Molec. Bioi 215 403-410 (1990)) The BLAST X program ,s 
50 publfcly available from NCBI and other sources (BLAST Manual. Altschul, S., et al.. NCBI NLM NIH Bethesda. MD 
20804 Altschul.S .efa/.. J Wo/. S;o/. 275; 403-410 (1990)). u • 

■isolated" means altered "by the hand of man" from its natural state, i.e., if it occurs in nature, it has been cnanged 
or removed from its original environment, or both For example, polynucleotide or a polypeptide naturally present in a 
living organism is not "isolated," but the same polynucleotide or polypeptide separated from the coexisting materials 
55 of its natural state is "isolated", as the term is employed herein. 

-Poiynucleotide(s)" generally refers tcany polyribonucleotide or polydeoxribonucleotide, which may be unmodified 
RNA or DN A or modified RNA or DNA •Polynucleotide(s)" include, without limitation, single-and double-stranded DNA 
DNA that is a mixture of single- and double-stranded regions or single-, double- and triple-stranded regions, single- 
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. K ^,nn acd seauence homology to known Eschanchm col, Sh.kimale 5-Dehydrogenase polypeptide 
wh.ch ,s related Z nucleotide and am,no acd sequences set out .n F.gure 1 .nd Figure 

TrtiXStd"^^^^^^^^^^ °' ^^'^^ "^''^ " 

^^^TiLCsl 'r:SL,e to evaluate te.pora, -^^^^^^^^^^^ 
under ..oratory and .osnn^^^^^^^ 

to survival per '^'^J^^^^^lf^^^^^ ,n,orma„on about ns funcfon and assists in the selection of 

:::Z;Z:e Z:n::i::!^:r:as . s.een., target. Br,e,,, these approaches include ,or example: 

1) Signature Tagged Mutagenesis (STM) 

■ . 1 o^,>,„^^ofiQ 4nn-4f)3(l995^ the contents of which is incorporated 

sucn ma, un„ue DNA sequence »gs a,e insenea " P'™'™^. » '^^^1° ,„p«„,(on. nadiola- 

bacteria recovered from irifecled hosts. <.v<:;tpm is less well developed, a more efficient way of 

r -vri^L^^^^^^^ - ~ -v Mo,„eon „ 

rS*! So S « coolen,. =, i. .ncon=o«,d ,e„™nc. ,0, ^acKgroond pu,poeee, 
25 2) In Vivo Expression Technology (IVET) 

, A/ A^r^ c^r^i nc;A 01-2634-2638 (1994) and Mahan ef a/., 
This technique, described by Ca^..^^^^^^^^^ 

Infectious Agents and Diseases 2.263-268 ^^^^f ^^^^ corripared to laboratory cultivation, implying 

-rr==— — ^^^^ 

geneinaplasmidvector.Thepool sintr^^^^^^ chromosomal fragment carried upstream of an ex- 

Upstream of the reporter gene allows identification of the up regulated gene. 
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3) Differential display 



pcaledby reference to, bac»9.ound ''''^^T^R^TS^^' f'"™"' 

r^rnrd::n^:;rdS^^^^^^^ 



genes 
library sequences 



4) Generation of conditional lethal mutants by transposon mutagenesis. 

V/ , , ^^noi9-^ 17-24 (1993V Neuwald, A F. eJ a/.. Gene 125. 69-73 

in this technique transposons carrying controllable promoters, wnicn pro organisms and 

poson in one or both directions are ^-erated^ Ban^^^^^^^^^^^ :ro.r artronrirth'at inso^ions which 

subsequent isolation of insertion mutants in ^/^^^^^^^^ ,or cell viability will be recovered. Sub- 

separate promoter from coding region of gene to sun/ive Sequencing of 
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Polypeptides 



The polypeptides of the invention include the polypeptide of Figure 2 [SEQ ID NO;2 ] (in particular the mature 
DOlvDeplide) as well as polypeptides and fragments parlicularly those which have the biological activity of aroE and 
5 also those which have at least 70% identity to the polypeptide of Figure 2 [SEQ ID NO 2] or the relevant portion, 
preferably at least 80% identity to the polypeptide of Figure 2 [SEQ ID NO 2], and more preferably at least 90% similarity 
(more preferably at least 90% identity) to the polypeptide of Figure 2 [SEQ ID NO;21 and still more preferably at least 
95% similarity (still more preferably at least 95% identity) to the polypeptide of Figure 2 [SEQ ID NO;2] and also include 
portions of such polypeptides with such portion of the polypeptide generally containing at least 30 amino acids and 
10 more preferably at least 50 ammo acids. 

Variants that are fragments of the polypeptides of the invention may be employed for producing the corresponding 
full-length polypeptide by peptide synthesis; therefore, these variants may be employed as intermediates for producing 
the full-length polypeptides. Variants that are fragments of the polynucleotides of the invention may be used to syn- 
thesize full-length polynucleotides of the invention. 
75 A fragment is a variant polypeptide having an amino acid sequence that entirely is the same as part but not all of 

the amino acid sequence of the aforementioned polypeptides As with aroE polypeptides fragments may be -free- 
standing," or comprised within a larger polypeptide of which they form a part or region, most preferably as a single 
continuous region, a single larger polypeptide. 

Prelerred fragments include, for example, truncation polypeptides having a portion of the amino acid sequence ot 
20 Fiqure 2 [SEQ ID NO 2] or of variants thereof, except for deletion of a continuous series of residues that includes the 
amino terminus or a continuous series of residues that includes the carboxyl terminus or deletion of two continuous 
series of residues one including the amino terminus and one including the carboxyl temiinus. Degradation forms of 
the polypeptides of the invention in a host cell, particularly a Streptococcus pneumoniae, are also preferred. Also 
preferred are fragments characterized by structural or functional attributes such as fragments that comprise alpha- 
2S helix and alpha-helix forming regions, beta-sheet and beta-sheot-forming regions, turn and turn-forming regions, coil 
and coil-forming regions, hydrophilic regions, hydrophobic regions, alpha amphipathic regions, beta amphipathic re- 
gions flexible regions, surface-forming regions, substrate binding region, and high antigenic index regions. 

Also preferred are biologically active fragments which are those fragments that mediate activities of aroE, including 
those with a similar activity or an improved activity or with a decreased undesirable activity. Also included are those 
30 fragments that are antigenic or immunogenic in an animal, especially in a human 

Polynucleotides 

Another aspect of the invention relates to isolated polynucleotides which encode the aroE polypeptide having the 
35 deduced amino acid sequence of Figure 2 [SEQ ID NO;2] and polynucleotides closely related thereto and variants 
thereto. 

Using the information provided herein, such as the polynucleotide sequence set out in 

40 Figure 1 [SEQ ID I^O 1], a polynucleotide of the invention encoding aroE polypeptide may be obtained using 

standard cloning and screening, such as those for cloning and sequencing chromosomal DNA fragments from Strep- 
tococcus pneumor7/ae 0100993 cells as starting material, followed by obtaining a full length clone. For example to 
obtain a polynucleotide sequence of the invention, such as that sequence given in Figure 1 [SEQ ID NO 1), typically 
a library of clones of chromosomal DNA of Streptococcus pneumoniae 01 00993 in E.coli or some other suitable host 
45 IS probed with a radiolabeled oligonucleotide, preferably a 1 7-mer or longer, derived from a partial sequence. Clones 
carrying DNA identical to that of the probe can then be distinguished using stringent conditions By sequencing the 
individual clones thus identified with sequencing primers designed from the original sequence it is then possible to 
extend the sequence in both directions to determine the full gene sequence. Conveniently such sequencing is per- 
formed usinq denatured double stranded DNA prepared from a plasmid clone Suitable techniques are described by 
so Maniatis T Fritsch, E.F and Sambrook et al., MOLECULAR CLONING, A LABORATORY MANUAL. 2nd Ed.- Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1989). (see Screening By Hybridizatior. 1 9C and 
Sequencing Denatured Double-Stranded DNA Templates 13.70). Illustrative of the invention, the polynucleotide set 
out in Figure I [SEQ ID NO: 1] was discovered in a DNA library derived from Sfropfococct/s pnoumon/ao 0100993 

The DNA sequence thus obtained is set out in Figure 1 [ SEQ ID NO 1 ] It contains an open reading frame encoding 
a pro'eir having about the number of am ino acid residues set forth in Figure 2 [SEQ ID NO:21 with a deduced molecular 
weight that can be calculated using amino acid residue molecular weight values well known in the art aroE of the 
invention is stmcturally related to other proteins of the aromatic ammo acid biosynthesis family as shown by the results 
of sequencing the DNA encoding aroE of the deposited strain The protein exhibits greatest homology to Escherichia 
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have a high sequence similarity to the aroE gene Such probes generally will connprise at least 15 bases Preferably, 
such probes will have at least 30 bases and may have at least 50 bases Particularly preferred probes will have at least 
30 bases and will have 50 bases or less 

For example the coding region of the aroE gene may be isolated by screening using the known DNA sequence 

5 to synthesize an oligonucleotide probe A labeled oligonucleotide having a sequence complementary to that of a gene 
of the invention is then used to screen a library of cDNA, genomic DNA or mRNA to determine which members of the 
library the probe hybridizes to 

The poiynucleotides and polypeptides of the invention may be employed as research reagents and materials for 
discovery of treatments of and diagnostics for disease, particularly human disease, as further discussed herein relating 

10 to polynucleotide assays, inter alia. 

Polynucleotides ofthe invention that are oligonucleotides derived from the sequences of SEQ ID NCS: 1 and 2 
may be used in the processes herein as described, but preferably for PGR, to determine whether or not the polynu- 
cleotides identified herein in whole or in part are transcribed in infected tissue. It is recognized that such sequences 
will also have utility in diagnosis of the stage of infection and type of infection the pathogen has attained. 

IS The polynucleotides may encode a polypeptide which is the mature protein plus additional amino or carboxyl- 

terminal amino acids, or ammo acids interior to the mature polypeptide (when the mature fornn has more than one 
polypeptide chain, for instance). Such sequences may play a role in processing of a protein from precursor to a mature 
form, may allow protein transport, may lengthen or shorten protein half-life or may facilitate manipulation of a protein 
lor assay or production, among other things. As generally is the case in vivo, the additional amino acids may be proc- 

20 essed away from the mature protein by cellular enzymes. 

A precursor protein, having the mature form of the polypeptide fused to one or more prosequences may be an 
inactive form of the polypeptide. When prosequences are removed such inactive precursors generally are activated. 
Some or all of the prosequences may be removed before activation. Generally, such precursors are called proproteins. 
In sum, a polynucleotide of the invention may encode a mature protein a mature protein plus a leader sequence 

25 (which may be referred to as a preprotein), a precursor of a mature protein having one or more prosequences which 
are not the leader sequences of a preprotein, or a preproprotein, which is a precursor to a proprotein, having a leader 
sequence and one or more prosequences, which generally are removed during processing steps that produce active 
and mature forms of the polypeptide. 

30 Vectors, host cells, expression 

The invention also relates to vectors which comprise a polynucleotide or polynucleotides of the invention, host 
cells which are genetically engineered with vectors of the invention and the production of polypeptides of the invention 
by recombinant techniques Cell-free translation systems can also be employed to produce such proteins using RNAs 

35 derived from the DNA constructs of the invention. 

For recombinant production, host cells can be genetically engineered to incorporate expression systems or portions 
thereof or polynucleotides of the invention. Introduction of a polynucleotide into the host cell can be effected by methods 
described in many standard laboratory manuals, such as Davis et al., BASIC METHODS IN MOLECULAR BIOLOGY, 
(1986) and Sambrook et at., MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed , Cold Spring Harbor Lab- 

40 oratory Press, Cold Spring Harbor. N.Y. (1989), such as, calcium phosphate transfection, DEAE-dextran mediated 
transfection, transvection, microinjection, cationic lipid-mediated transfection, electroporation, transduction, scrape 
loading, ballistic introduction and infection. 

Representative examples of appropriate hosts include bacterial ceils, such as streptococci, staphylococci, E. coli, 
streptomyces and Baallus subtifis cells: fungal cells, such as yeast cells and Aspergillus cells: insect cells such as 

45 Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS. HeLa, CI 27, 3T3, BHK, 293 and Bowes 
melanoma cells: and plant cells. 

A great variety of expression systems can be used to produce a polypeptide of the invention. Such vectors include, 
among others, chromosomal, episomal and virus-derived vectors, e.g., vectors derived from bacterial plasmids, from 
bacteriophage, from transposons. from yeast episomes, from insertion elements, from yeast chromosomal elements, 

50 from viruses such as baculoviruses. papova viruses, such as SV40. vaccinia viruses, adenoviruses, fowl pox viruses, 
pseudorabies viruses and retroviruses, and vectors derived from combinations thereof, such as those derived from 
plasmid and bacteriophage genetic elements, such as cosm.ids and phagemids. The expression system constructs 
may contain control regions that regulate as well as engender expression. Generally any system or vector suitable to 
maintain, propagate or express polynucleotides and/or to express a polypeptide in a host may be used for expression 

55 in this regard The appropriate DNA sequence may be inserted into the expression system by any of a variety of well- 
known and routine techniques, such as, for example, those set forth in Sambrook et al., MOLECULAR CLONING, A 
LABORATORY MANUAL, (supra) 

For secretion of the translated protein into the lumen of the endoplasmic reticulum, into the penplasmic space or 
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such as. for example, amplification. PGR. RT-PCR, RNase protection, Northem blotting and other hybridization meth- 
ods 

In addition, a diagnostic assay in accordance with the invention for detecting over-expression of aroE protein 
compared to normal control tissue samples may be used to detect the presence of an infection, for example Assay 
techniques that can be used to determine levels of a aroE protein, in a sample derived trom a host are well-known to 
those of skill in the art. Such assay methods include radioimmunoassays, competitive-binding assays, Western Blot 
analysis and ELISA assays. 

Antibodies 



The polypeptides of the invention or variants thereof or cells expressing them can be used as an immunogen to 
produce antibodies immunospecific for such polypeptides. "Antibodies" as used herein includes monoclonal and pol- 
yclonal antibodies, chimeric, single chain, simianized antibodies and humanized antibodies, as well as Fab fragments, 
including the products of an Fab immunolglobulin expression library. 
15 Antibodies generated against the polypeptides of the invention can be obtained by administering the polypeptides 

or epitope-beanng fragments, analogues or cells to an animal, preferably a nonhuman, using routine protocols. For 
preparation of monoclonal antibodies, any technique known in the art which provides antibodies produced by contin- 
uous cell line cultures can be used Examples include various techniques, such as those in Kohler. G. and Milstein. 
C. Nature256: 495-497(1975): Kozbor efa/., Immunology Today 4:72 (1983): Coleetal.. pg. 77-96 in MONOCLONAL 
20 ANTIBODIES AND CANCER THERAPY. Alan R. Liss, Inc. (1 985) 

Techniques for the production of single chain antibodies (U.S. Patent No. 4,946,778) can be adapted to produce 
single chain antibodies to polypeptides of this invention. Also, transgenic mice, or other organisms such as other mam- 
mals may be used to express humanized antibodies. 

Alternatively phage display technology couid be utilized to select antibody genes with binding activities towards 
25 the polypeptide either from repertoires of PGR amplified v-genes of lymphocytes from humans screened for possessing 
anti-Fbp or from naive libraries (McCafferty, J. et al., (1 990). Nature 348. 552-554; Marks. J. et al.. (1 992) Biotechnology 
10, 779-783). The affinity of these antibodies can also be improved by chain shuffling (Clackson. T. et al., (1 991 ) Nature 
352, 624-628). 

If two antigen binding domains are present each domain may be directed against a different epitope - termed 
30 'bispecific* antibodies. 

The above-described antibodies may be employed to isolate or to identify clones expressing the polypeptides to 
purify the polypeptides by affinity chromatography. 

Thus, among others, antibodies against aroE may be employed to treat infections, particularly bacterial infections 
and especially otitis media, conjunctivitis, pneumonia, bacteremia, meningitis, sinusitis, pleura! empyema and endo- 
ws carditis, and most particularly meningitis, such as for example infection of cerebrospinal fluid,. 

Polypeptide variants include antigenically, epitopically or immunologically equivalent variants which form a partic- 
ular aspect of this invention The term "antigenically equivalent derivative" as used herein encompasses a polypeptide 
or its equivalent which will be specifically recognised by certain antibodies which, when raised to the protein or polypep- 
tide according to the invention, interfere with the immediate physical interaction between pathogen and mammalian 
40 host. The term "immunologically equivalent derivative" as used herein encompasses a peptide or its equivalent which 
when used in a suitable formulation to raise antibodies in a vertebrate, the antibodies act to interfere with the immediate 
physical interaction between pathogen and mammalian host. 

The polypeptide, such as an antigenically or immunologically equivalent derivative or a fusion protein thereof is 
used as an antigen to immunize a mouse or other animal such as a rat or chicken. The fusion protein may provide 
45 stability to the polypeptide The antigen may be associated, for example by conjugation with an immunogenic earner 
protein for example bovine serum albumin (BSA) or keyhole limpet haemocyanin (KLH). Alternatively a multiple anti- 
genic peptide comprising multiple copies of the protein or polypeptide, or an antigenically or immunologically equivalent 
polypeptide thereof may be sufficiently antigenic to improve immunogenicily so as to obviate the use of a carrier. 

Preferably the antibody or variant thereof is modified to make it less immunogenic in the individual. For example, 
50 if the individual is human the antibody may most preferably be "humanized-, where the complimentarity determining 
region(s) of the hybridoma-derived antibody has been transplanted into a human monoclonal antibody, for example as 
described in Jones, P et al (1986), Nature 321, 522-525 or Tempest et al ,(1991) Biotechnology 9.266,273. 

The use of a polynucleotide of the invention in genetic immunization will preferably employ a suitable delivery 
method such as direct injection of plasmid DNA into muscles (Wolff et al., Hum Mol Genet 1 992, 1 :363, Manthorpe et 
55 ai Hum. Gene Ther. 1963:4. 419). delivery of DNA complexed with specific protein carriers fWu et al.. J Biol Chem 
1989 264.16985), coprecipitation of DNA With calcium phosphate (Benventsty & Reshef, PNAS, 1986 83 9551), en- 
capsulation of DNA in various forms of liposomes (Kaneda et al . Science 1989 243.375). particle bombardment (Tang 
eta! Nature 1992. 356 152, Eisenbraun et al . DNA Cell Biol 1993. 12:791) and /n wVo infection using cloned retroviral 
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produce antibody to protect said individual from infection, particularly bacterial infection and most particularly Strepto- 
coccus pneumoniRe infections Yet another aspect of the invention relates to a method of inducing innmunological 
response in an individual which compnses. through gene therapy, delivering gene encoding aroE, or afragnnent or a 
variant thereof, for expressing aroE or a fragment or a variant thereof in vivo in order to induce an inamunological 

5 response to produce antibody to protect said individual from disease. 

A further aspect of the invention relates to an immunological composition which, when introduced into a host 
capable or having induced within it an immunological response, induces an immunological response in such host to a 
aroE or protein coded therefrom, wherein the composition comprises a recombinant aroE or protein coded therefrom 
comprising DNA which codes for and expresses an antigen of said aroE or protein coded therefrom. 

10 The aroE or a fragment thereof may be fused with co-protein which may not by itself produce antibodies, but is 

capable of stabilizing the first protem and producing a fused protein which will have immunogenic and protective prop- 
erties Thus fused recombinant protein, preferably further comprises an antigenic co-protein, such as Glutathione-S- 
transferase (GST) or beta-galactosidase, relatively large co-proteins which solubilisethe protein and facilitate produc- 
tion and purification thereof. Moreover, the co-protein may act as an adjuvant in the sense of providing a generalized 

?5 stimulation of the immune system. The co-protein may be attached to either the amino or carboxy terminus of the first 
protein. 

Provided by this, invention are compositions, particularly vaccine compositions, and methods compnsing the 
polypeptides or polynucleotides of the invention and immunostimulatory DNA sequences, such as those described in 
Sato, Y. etal Science 273: 352 (1996). 

20 Also, provided by this invention are methods using the described polynucleotide or particular fragments thereof 

which have been shown to encode non-variable regions of bacterial cell surface proteins in DNA constructs used in 
such genetic immunization experiments in animal models of infection with Streptococcus pneumoniae m\\ be particu- 
larly useful for identifying protein epitopes able to provoke a prophylactic or therapeutic immune response, it is believed 
that this approach will allow for the subsequent preparation of monoclonal antibodies of particular value from the req- 

25 uisitc organ of the animal successfully resisting or clearing infection for the development of prophylactic agents or 
therapeutic treatments of bacterial infection, particularly Streptococcus pneumoniae infections, in mammals, particu- 
larly humans. 

The polypeptide may be used as an antigen for vaccination of a host to produce specific antibodies which protect 
against invasion of bactena. for example by blocking adherence of bacteria to damaged tissue. Examples of tissue 
30 damage include wounds in skin or connective tissue caused e g by mechanical, chemical or thermal damage or by 
implantation of indwelling devices, or wounds in the mucous membranes, such as the mouth, mammary glands, urethra 

or vagina. 

The invention also includes a vaccine formulation which compnses the immunogenic recombinant protein together 
with a suitable carrier. Since the protein may be broken down in the stomach, it is preferably administered parenterally, 

35 including, for example, administration that is subcutaneous, intramuscular, intravenous, or intradermal. Formulations 
suitable for parenteral administration include aqueous and non-aqueous stenle injection solutions which may contain 
anti-oxidants. buffers, bacteriostats and solutes which render the formulation instonic with the bodily fluid, preferably 
the blood, of the individual: and aqueous and non -aqueous stenle suspensions which may include suspending agents 
or thickening agents. The formulations may be presented in unit-dose or multi-dose containers, for example, sealed 

40 ampoules and vials and may be stored in a freeze-dried condition requiring only the addition of the sterile liquid carrier 
immediately prior to use. The vaccine formulation may also include adjuvant systems for enhancing the immunogenicity 
of the formulation, such as oil-in water systems and other systems known tn the art. The dosage will depend on the 
specific activity of the vaccine and can be readily determined by routine experimentation. 

While the invention has been described with reference to certain aroE, it is to be understood that this covers 

-iS fragments of the naturally occurring protein and similar proteins with additions, deletions or substitutions which do not 
substantially affect the immunogenic properties of the recombinant protein. 

Compositions, kits and administration 

50 The invention also relates to compositions comprising the polynucleotide or the polypeptides discussed above or 

the agonists or antagonists. The polypeptides of the invention may be employed in combination with a non-sterile or 
sterile carrier or carriers for use with ceils tissues or organisms, such as a pharmaceutical carrier suitable for admin- 
istration to a subject. Such compositions comprise, for instance, a media additive or a therapeutically effective amount 
of a polypeptide of the invention and a pharmaceutically acceptable carrier or excipient. Such earners may include, 

55 but are not limited to, saline buttered saline dextrose, water, glycerol, ethanol and combinations thereof. The formu- 
lation should suit the mode of administration. The invention further relates to diagnostic and pharmaceutical packs and 
kits compnsing one or more containers filled with one or more of the ingredients of the aforementioned compositions 
of the invention 
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size-fractionated by either of two methods. 
Method 1 

5 Total cellular DNA is mechanically sheared by passage through a needle in order to size-fractionate according to 

standard procedures. DNA fragments of up to 1 Ikbp in size are rendered blunt by treatment with exonuclease and DNA 
polymerase, and EcoRI linkers added. Fragments are ligated into the vector Lambda Zapll that has been cut with 
EcoRI. the library packaged by standard procedures and E.coli infected with the packaged library. The library is am- 
plified by standard procedures. 

10 

Method 2 

Total cellular DNA is partially hydrolyzed with a one or a combination of restriction enzymes appropriate to generate 
a series of fragments for cloning into library vectors (e.g , Rsal, Pall, Alul, Bsh1235l), and such fragments are size- 
75 fractionated according to standard procedures. EcoRI linkers are ligated to the DNA and the fragments then ligated 
into the vector Lambda Zapll that have been cut with EcoRI, the library packaged by standard procedures, and E.coii 
infected with the packaged library. The library is amplified by standard procedures. 
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(2) INFORMATION FOR SEQ ID NO 1 : 

950 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 772 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
955 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 : 

ATGAAGCTTG ATGGCTATAC ACGTTTAGCT GCAGTTGTTG CCAATCCTAT 
TAAGCATTCT 60 

ATTTCTCCCT TCATCCACAA TAGAGCCTTT GAGGCGACAG CTACCAACGG 
TGCTTATGTG 120 

GCTTGGGAGA TTGAAGCGAG TGACTTGGTA GAAACAGTGG CCAATATTCG 

965 TCGCTACCAG 180 

ATGTTTGGCA TCAATCTGTC CATGCCCTAT AAGGAGLAGG TGATTCCTTA 

TTTGGATAAG 240 

CTGAGCGATG AAGCGCGCTT GATTGGTGCG GTTAATACGG TTGTCAATGA 
GAATGGCAAT 300 
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35 40 4s 

Leu Val Glu Thr Val Ala Asn lie Arg Arg Tyr Gin Met Phe Gly lie 
1005 50 55 60 

Asn Leu Ser Met Pro Tyr Lys Glu Gin Val lie Pro Tyr Leu Asp Lys 
65 70 75 80 

Leu Ser Asp Glu Ala Arg Leu He GK Ala Val Asn Thr Val Val Asn 
85 90 95 

1010 Glu Asn Gly Asn Leu lie Gly Tyr Asn Thr Asp Gly Lys Gly Phe Phe 
100 105 110 

Lns Cns Leu Pro Ser Phe Thr lie Ser Gly Lys Lys Met Thr Leu Leu 

115 120 125 

Gly Ala Gly Gly Ala Ala Lys Ser He Leu Ala Gin Ala lie Leu Asp 
1015 130 135 140 

Gly Val Ser Gin lie Ser Val Phe Val Arg Ser Val Ser Met Glu Lys 
145 150 155 160 

Thr Arg Pro Tyr Leu Asp Lys Leu Gin Glu Gin Thr Gly Phe Lys Val 
165 170 175 

1020 Asp Leu Cys Ala Leu Glu Tyr Val Ser Glu Leu Gin Ala Arg lie Ala 
180 185 190 

Glu Ser Asp Leu Leu Val Asn Ala Thr Ser Val Gly Met Asp Gly Gin 

195 200 205 

Phe Ser Pro Val Pro Glu Asn He Val Leu Pro Glu Thr Leu Leu Val 
1025 210 215 220 

Ala Asp He lie Tyr Gin Pro Phe Glu Thr Pro Phe Leu Lys Trp Ala 
225 230 235 240 

Arg Ser Gin Gly Asn Pro Ala Val Asn Gly Leu Gly Met Leu Leu Tyr 
245 250 255 

1030 Gin 



(2) lNFORVL\TlON FOR SEQ ID N0:3: 
1035 (DSEOL-ENCE CHARACTERISTICS: 
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6. The polynucleotide of Claim 2 which encodes a polypeptide connprising ammo acid 1 to 257 of SEQ ID NO:2. 

7. An isolated polynucleotide comprising a member selected from the group consisting of: 

5 (a) a polynucleotide having at least a 70% identity to a polynucleotide encoding the same mature polypeptide 

expressed by the aroE gene contained in NCIMB Deposit No. 40794; 

(b) a polynucleotide complementary to the polynucleotide of(a); and 

(c) a polynucleotide comprising at least 15 bases of the polynucleotide of (a) or (b). 

10 8. A vector compnsing the DNA of Claim 2. 

9. A host cell comprising the vector of Claim 8. 

10. A process for producing a polypeptide comprising: expressing from the host cell of Claim 9 a polypeptide encoded 
15 by said DNA. 

1 1 A process for producing a cell which expresses a polypeptide comprising transforming or transfecting the cell with 
* the vector of Claim 8 such that the cell expresses the polypeptide encoded by the cDNA contained in the vector. 

12. A process for producing a aroE polypeptide or fragment comprising culturing a host of claim 9 under conditions 
sufficient for the production of said polypeptide or fragment. 

13. A polypeptide comprising an amino acid sequence which is at least 70% identical to amino acid 1 to 257 of SEQ 
ID NO:2. 

14. A polypeptide comprising an amino acid sequence as set forth in SEQ ID NO:2. 

15. An antibody against the polypeptide of claim 13. 

30 16. An antagonist which inhibits the activity of the polypeptide of claim 13. 

17. A method for the treatment of an individual having need of aroE comprising: administering to the individual a 
therapeutically effective amount of the polypeptide of claim 1 3. 

18 The method of Claim 16 wherein said therapeutically effective amount of the polypeptide is administered by pro- 
viding to the individual DNA encoding said polypeptide and expressing said polypeptide in wvo. 

19. A method for the treatment of an individual having need to inhibit aroE polypeptide comprising: administering to 
' the individual a therapeutically effective amount of the antagonist of Claim 16. 

20. A process for diagnosing a disease related to expression of the polypeptide of claim 1 3 comprising: 
determining a nucleic acid sequence encoding said polypeptide. 

21. A diagnostic process comprising: 
analyzing for the presence of the polypeptide of claim 13 in a sample derived from a host. 

22. A method for identifying compounds which bind to and inhibit an activity of the polypeptide of claim 1 3 comprising: 

contacting a cell expressing on the surface thereof a binding for the polypeptide, said binding being associated 
with a second component capable of providing a detectable signal in response to the binding of compound to 
said binding with a compound to be screened under conditions to permit binding to the binding; and 
determining whether the compound binds to and activates or inhibits the binding by detecting the presence 
or absence of a signal generated from the interaction of the compound with the binding. 

23 A method for inducing an immunological response in a mammal which compnses inoculating the mammal with 
aroE. or a fragment or variant thereof, adequate to produce antibody to protect said animal from disease. 

24. A method of inducing immunological response in a mammal which comprises through gene therapy delivering 
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(54) Streptococcus pneumoniae aroE polypeptides and polynucleotides 



(57) The invention provides Streptococcus pneu- 
moniae aroE polypeptides and DNA (RNA) encoding 
aroE polypetides and methods tor producing such 



polypeptides by recombinant techniques Also provided 
are methods for utilizing aroE polypeptide for the pro- 
tection against infection, particularly bacterial infec- 
tions. 
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»n.h«.,nh rlaims 17-19. 23 and 24 are directed to a method of treatment of 
Jle SSSn/aSSSl body {Article 52(4) EPC). the search has been carried 
iS? 5!3 SasSHn the alleged effects of the conpound/conposition. 
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cuims 16 and in part 19 relate to an antagonist of the polypeptide 
withwt giving a true technical characterization. Horeover no such 
JiSSSndl are defined in the application In consequence, the, scope of 

claims is ambiguous and vague, and their subject-matter is not 
ISfficiJntly discloled and supported (Art. 83 and B4 EPC). Ho search can 
be clrried out for such purely speculative claims whose wording is. m 
fact, a mere recitation of the results to be achieved. 
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